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\ SEQUENCE LISTING 

GENERAL INFORMATION: 

(i) \ APPLICANT: Somerville, Chris 

\ Broun, Pierre 

\ van de Loo, Frank 

\ ' Boddupalli, Sekhar S. 

(ii) TITLE OF INVENTION: Production of Hydroxylated 
Fatty Acids in Genetically Modified Plants 

(iii) NUMBER OF SEQUENCES: 15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PILLSBURY MADISON & SUTRO 

(B) STREET: 1100 NEW YORK AVENUE, N.W. 

(C) CITY: WASHINGTON 

(D) S-IATE: D.C. 

(E) COUNTRY: USA 

(F) ZIR: 20005-3918 

(v) COMPUTER >READABLE FORM: 

(A) MEDIUM TYPE: 3,5 inch, 1.44 MB storage 

<B) COMPUTER: IBM compatible 

(C) OPERATING SYSTEM: DOS 5.0 

<D) SOFTWARE: Word Perfect 5,1 

<vi) CURRENT APPLICATION DATA; 

(A) APPLICATION NUMBER: not yet assigned 
<B) FILING DATE: February 6, 1997 
<C) CLASSIFICATION: 



(2) INFORMATION FOR SEQ ID NO : 1 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 Wcleot ides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS :\ single 

(D) TOPOLOGY: linfear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 1 : 



TATTGGCACC 


GGCGGCACCA 


TTCQA 


ACAAT 


GGATCCCTAG 


40 


AAAAAGATGA 


AGTCTTTGTC 


CCACO 


TAAGA 


AAG CTGCAGT 


80 


CANATGGTAT 


GTCAAATACC 


TCAAC 


WACCC 


TCTTGGACGC 


120 


ATTCTGGTGT 


TAACAGTTCA 


GTTTATQCTC 


GGGTGGCCTT 


160 
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TGTATOTAGC 


ctttaatgta 


TCAGGTAGAC 


CTTATGATGG 


200 


TTTCGCTTCA 


CATTTCTTCC 


CTCATGCACC 


TATCTTTAAG 


240 


gaccgtgaac 


GTCTCCAGAT 


ATACATCTCA 


GATGCTGGTA 


280 


ttctagctgV 4 


CTGTTATGGT 


CTTTACCGTT 


ACGCTGCTTC 


320 


acaaggattg\ 


ACTGCTATGA 


TCTGCGTCTA 


CGGAGTACCG 


360 


cttttgatag 


TGAACTTTTT 


CCTTGTCTTG 


GTCACTTTCT 


400 


tgcagcacac 


TCATCCTTCA 


TTACCTCACT 


ATGATTCAAC 


440 


cgagtgggaa 


TQGATTAGAG 


GAGCTTTGGT 


TACGGTAGAC 


480 


agagactatg 


GAATCTTGAA 


CAAGGTGTTT 


CACAACATAA 


520 


cagacaccca 


CGTAGCACAC 


CAC 




543 



(2) 



INFORMATION FOR SEQ ID NO : 2 



9>. 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: \544 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDlfesS : single 

(D) TOPOLOGY :\ linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 

TATAGGCACC GGAGGCACCA\ TTCCAACACA GGATCCCTCG 4 0 

AAAGAGATGA AGTATTTGTC CCAAAGCAGA AATCCGCAAT 80 

CAAGTGGTAC GGCGAATACC TCAACAACCC TCCTGGTCGC 120 

ATCATGATGT TAACTGTCCA GTTCGTCCTC GGATGGCCCT 160 

TGTACTTAGC CTTCAACGTT TCTjGGCAGAC CCTACAATGG 200 

TTTCGCTTCC CATTTCTTCC CCAATGCTCC TATCTACAAC 240 

GACCGTGAAC GCCTCCAGAT TTACATCTCT GATGCTGGTA 280 

TTCTAGCCGT CTGTTATGGT CTTTACCGTT ACGCTGTTGC 3 20 

ACAAGGACTA GCCTCAATGA TCTGTCKAAA CGGAGTTCCG 360 

CTTCTGATAG TTAACTTTTT CCTCGTCTTG AT CACTT ACT 400 
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TACAACACAC TCACCCTGCG TTGCCTCACT ATGATTCATC 440 

AGAGTGGGAT TGGCTTAGAG GAGCTTTAGC TACTGTAGAC 48 0 

agagactatg\gaatcttgaa CAAGGTGTTC CATAACATCA 52 0 

CAGACACCCA GGTCGCACAC CACT 544 

(2) INFORMATION FOR SEQ ID NO: 3 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1855 nucleotides 

(B) TYPE :\ nucleotide 

(C) STRANIDEDNESS : single 
ID) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATGAAGCTTT ATAAGAi^GTT AGTTTTCTCT GGTGACAGAG 4 0 

AAATTNTGTC AATTGGTAGT GACAGTTGAA GCAACAGGAA 8 0 

CAACAAGGAT GGTTGGTGNT GATGCTGATG TGGTGATGTG 120 

TTATTCATCA AATACTAAATy ACTACATTAC TTGTTGCTGC 160 

CTACTTCTCC TATTTCCTCC GCCACCCATT TTGGACCCAC 2 00 

GANCCTTCCA TTTAAACCCT OTCTCGTGCT ATTCACCAGA 24 0 

AG AGAAGC CA AGAGAGAGAG AQAGAGAATG TTCTGAGGAT 280 

CATTGTCTTC TTCATCGTTA TT^ACGTAAG TTTTTTTTGA 320 

CCACTCATAT CTAAAATCTA GTAQATGCAA TAGATTAATG 360 

ACTGTTCCTT CTTTTGATAT TTTCAGCTTC TTGAATTCAA 400 

GATGGGTGCT GGTGGAAGAA TAATG^TTAC CCCCTCTTCC 44 0 

AAGAAATCAG AAACTGAAGC CCTAaAcGT GGACCATGTG 4 80 

AGAAACCACC ATT CACTGTT AAAGATC^GA AGAAAGCAAT 52 0 

CCCACAGCAT TGTTTCAAGC GCTCTATCGC TCGTTCTTTC 560 

TCCTACCTTC TCACAGATAT CACTTTAGTTt TCTTGCTTCT 600 

ACTACGTTGC CACAAATTAC TTCTCTCTTCl TTCCTCAGCC 64 0 
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TCTCTCTACT 


TACCTAGCTT 


GGCCTCTCTA 


TTGGGTATGT 


680 


CAAGGCTGTG 


TCTTAACCGG 


TATCTGGGTC 


ATTGGCCATG 


72C 


AATGTGGTCA 


COATGCATTC 


AGTGACTATC 


AATGGGTAGA 


760 


TGACACTGTT 


GGmTTATCT 


TCCATTCCTT 


CCTTCTCGTC 


800 


CCTTACTTCT 


CCTGGAAATA 


CAGTCATCGT 


CGTCACCATT 


840 


CCAACAATGG 


ATC1TCTCGAG 


AAAGATGAAG 


TCTTTGTCCC 


880 


AC CGAAGAAA 


GCTGGAGTCA 


AATGGTATGT 


TAAATACCTC 


920 


AACAACCCTC 


TTGGACGCAT 


TCTGGTGTTA 


ACAGTTCAGT 


960 


TTATCCTCGG 


GTGGCQTTTG 


TATCTAGCCT 


TTAATGTATC 


1000 


AGGTAGACCT 


TAT GAT GGTT 


TCGCTTCACA 


TTTCTTCCCT 


1040 


CATGCACCTA 


TCTTTAAAGA 


CCGAGAACGC 


CTCCAGATAT 


1080 


ACATCTCAGA 


TGCTGGTATT 


CTAGCTGTCT 


GTTATGGTCT 


1120 


TTACCGTTAC 


GCTGCTTCAu 


AAGGATTGAC 


TGCTATGATC 


1160 


TGCGTCTATG 


GAGTACCGCT 1 


i TTTGATAGTG 


AACTTTTTCC 


1200 


TTGTCTTGGT 


AACTTTCTTG 


CAGCACACTC 


ATCCTTCGTT 


1240 


ACCTCATTAT 


GATTCAACCG 


Agtgggaatg 


GATTAGAGGA 


1280 


GCTTTGGTTA 


CGGTAGACAG 


AGACTATGGA 


ATATTGAACA 


1320 


AGGTGTTCCA 


TAACATAACA 


GACACACATG 


TGGCTCATCA 


1360 


TCTCTTTGCA 


ACTATACCGC 


ATTATAACGC 


AATGGAAGCT 


1400 


ACAGAGGCGA 


TAAAGCCAAT 


ACTTGGTGAT 


TACTACCACT 


1440 


TCGATGGAAC 


ACCGTGGTAT 


GTGGCCATGT 


ATAGGGAAGC 


1480 


AAAGGAGTGT 


CTCTATGTAG 


AACCGCATAC 


GGAACGTGGG 


1520 


AAGAAAGGTG 


T CT ACT ATT A 


CAACAATAAG 


TTATGAGGCT 


1560 


GATAGGGCGA 


GAGAAGTGCA 


ATTATC^ATC 


TTCATTTCCA 


1600 


TGTTTTAGGT 


GTCTTGTTTA 


AGAAGCTATG 


CTTTGTTTCA 


1640 


AT AAT CTCAG 


AGTCCATNTA 


GTTGTGTT^T 


GGTGCATTTT 


1690 
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GCCTAGTTAT IpTGGTGTCGG AAGTTAGTGT TCAAACTGCT 1720 

TCCTGCTGTG GTGCCCAGTG AAGAACAAGT TTACGTGTTT 17 60 

AAAATACTCG G?VACGAATTG ACCACAANAT ATCCAAAACC 1800 

GGCTATCCGA ATfTCCATATC CGAAAACCGG ATATCCAAAT 184 0 

TTCCAGAGTA CT^AG 18 55 



(2) 



INFORMATION FOR SEQ ID NO : 4 



(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 384 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGtY; linear 

fxi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 

Met Gly Ala Gly Gl* Arg He Met Val Thr 

5 \ 10 

Pro Ser Ser Lys Lys Wr Glu Thr Glu Ala 

15 \ 20 

Leu Lys Arg Gly Pro cVs Glu Lys Pro Pro 

25 \ 30 

Phe Thr Val Lys Asp Leu Lys Lys Ala lie 

35 \ 40 

Pro Gin His Cys Phe Lys\Arg Ser He Pro 

45 \ 50 

Arg Ser Phe Ser Tyr Leu Leu Thr Asp He 

55 \ 60 

Thr Leu Val Ser Cys Phe Tyr Tyr Val Ala 

65 \ 70 



Thr Asn Tyr Phe Ser Leu Leu Pro Gin Pro 

75 \ 80 

Leu Ser Thr Tyr Leu Ala Trp\ Pro Leu Tyr 

85 \ 90 

Trp Val Cys Gin Gly Cys Val Leu Thr Gly 

95 \ 100 
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lie Trp Va^ 
His Ala Phel 
Asp Thr Val 
Leu Leu Val 
Ser His Arg 
Ser Leu Glu 
Pro Lys Lys 
Lys Tyr Leu 
Leu Val Leu 
Trp Pro Leu 
Gly Arg Pro 
Phe Phe Pro 
Arg Glu Arg 
Ala Gly lie 
Tyr Arg Tyr 
Ala Met lie 
Leu lie Val 




Thr Val 
185 

Tyr Leu 
195 

Tyr Asp 
205 

His Ala 
215 

Leu Gin 
225 

Leu Ala 
235 

Ala Ala 
245 

Cys Val 
255 

Asn Phe 
265 




Gly His 
110 

Val Asp 
120 

Ser Phe 
130 

Lys Tyr 
140 

Asn Gly 

: 150 

Val Pro 
160 

Tyr Val 
170 

Arg lie 
180 

Leu Gly 
190 

Val Ser 
200 

Ser His 
210 

Lys Asp 
220 

Ser Asp 
230 

Gly Leu 
240 

Leu Thr 

250 

Pro Leu 
260 

Leu Val 
270 
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Thr Phe Leu\ Gin His Thr His 

275 



Pro Ser Leu 
280 



Pro His Tyr I\sp Ser Thr Glu 

285 



Trp Glu Trp 
290 



lie Arg Gly A3\a Leu Val Thr 

295 



Val Asp Arg 
300 



Asp Tyr Gly Ile\ Leu Asn Lys 

305 



Val Phe His 
310 



Asn lie Thr Asp \hr His Val 

15 



Ala His His 
320 



Leu Phe Ala Thr lie Pro His 

32^ 



Tyr Asn Ala 
330 



Met Glu Ala Thr Glu\ Ala lie 

335 



Lys Pro lie 
340 



Leu Gly Asp Tyr Tyr IJis Phe 

345 



Asp Gly Thr 
350 



Pro Trp Tyr Val Ala Me£ Tyr 

355 



Arg Glu Ala 
360 



Lys Glu Cys Leu Tyr Val\Glu 

365 



Pro Asp Thr 
370 



Glu Arg Gly Lys Lys Gly X al 

375 



Tyr Tyr Tyr 
380 



Asn Asn Lys Leu 



(2) INFORMATION FOR SEQ ID NO: 5 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 87 amino lac ids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEfc ID NO : 5 : 

Met Gly Gly Gly Gly Arg Met Ser\ Thr Val 

5 \ 10 



lie Thr Ser Asn Asn Ser Glu Lys pys Gly 

15 1 20 
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Pro His 
30 

Asp Leu 
40 

Phe Glu 
50 

Tyr Val 
60 

Leu Phe 
70 

Pro Tyr 
80 

Ala Trp 
90 

Cys lie 
100 

His Glu 
110 

Tyr Gin 
120 

lie Val 
130 

Phe Ser 
140 

His Ser 
150 

Glu Val 
160 

lie Ser 
170 

Pro Pro 
180 

Thr Leu 
190 
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Leu 


Leu 


Ply 


Trp 


Pro 
195 


Leu 


Tyr 


Leu 


Ala 


Phe 
200 


Asn 


Val 


Ser 


Gly 


Arg 
205 


Pro 


Tyr 


Asp 


Arg 


Phe 
210 


Ala 


Cys 


His\ 


Tyr 


Asp 
215 


Pro 


Tyr 


Gly 


Pro 


He 
220 


Phe 


Ser 


Glu 


Arg 


Glu 
225 


Arg 


Leu 


Gin 


He 


Tyr 
230 


He 


Ala 


Asp 


Le\u 


Gly 
235 


lie 


Phe 


Ala 


Thr 


Thr 
240 


Phe 


Val 


Leu 


Tyr\ 


Gin 
\245 


Ala 


Thr 


Met 


Ala 


Lys 
250 


Gly 


Leu 


Ala 


Trp 


Val 
255 
\ 


Met 


Arg 


He 


Tyr Gly 
260 


Val 


Pro 


Leu 


Leu 


n\ 

265V 


Val 


Asn 


Cys 


Phe 


Leu 
270 


Val 


Met 


He 


Thr 


Tyr 
275 


Y,eu 


Gin 


His 


Thr 


His 
280 


Pro 


Ala 


He 


Pro 


Arg 
285 


tW 


Gly 


Ser 


Ser 


Glu 
290 


Trp 


Asp 


Trp 


Leu 


Arg 
295 


Gl\ 


Ala 


Met 


Val 


Thr 
300 


Val 


Asp 


Arg 


Asp 


Tyr 
305 


Gly Wal 


Leu 


Asn 


Lys 
310 


Val 


Phe 


His 


Asn 


He 
315 


Ala 


Asp 


Thr 


His 


Val 
320 


Ala 


His 


His 


Leu 


Phe 
325 


Ala 


Thr 


Val 


Pro 


His 
330 


Tyr 


His 


Ala 


Met 


Glu 
335 


Ala 


ThA 


Lys 


Ala 


He 
340 


Lys 


Pro 


He 


Met 


Gly 
345 


Glu 


Tyr 


Tyr 


Arg 


Tyr 
350 


Asp 


Gly 


Thr 


Pro 


Phe 
355 


T/r 


Lys 


Ala 


Leu 


Trp 
360 
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Arg GlVi Ala Lys Glu Cys Leu Phe Val Glu 
\ 365 370 

Pro Asp\Glu Gly Ala Pro Thr Gin Gly Val 
\ 375 380 

Phe Trp Tyr Arg Asn Lys Tyr 
\ 385 

(2) INFORMATION FOR SEQ ID NO : 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

<xi) SEQUENCE ^DESCRIPTION : SEQ ID NO: 6 

Met Gly Ala Glyy Gly Arg Met Pro Val Pro 

5 10 

Thr Ser Ser Lys Lys Ser Glu Thr Asp Thr 

V6 20 

Thr Lys Arg Val pAd Cys Glu Lys Pro Pro 

25\ 30 

Phe Ser Val Gly Asp\Leu Lys Lys Ala lie 

35 \ 40 



Pro Pro His Cys Phe iys Arg Ser He Pro 

45 \ 50 



Leu 



Arg Ser Phe Ser Tyr Leu He Ser Asp He 

55 \ 60 

He He Ala Ser Cys Phe Tyr Tyr Val Ala 

65 \ 70 

Thr Asn Tyr Phe Ser Leu\Leu Pro Gin Pro 

75 \ 80 

Leu Ser Tyr Leu Ala Trp Paro Leu Tyr Trp 

85 \ 90 

Ala Cys Gin Gly Cys Val Le^i Thr Gly He 

95 \ 100 

Trp Val He Ala His Glu Cys \dy His His 

105 \ HO 
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Ala Phe\ Ser Asp Tyr Gin Trp Leu Asp Asp 

115 120 

Thr Val tely Leu lie Phe His Ser Phe Leu 

125 130 

Leu Val Pi\o Tyr Phe Ser Trp Lys Tyr Ser 

135 140 

His Arg Arg\ His His Ser Asn Thr Gly Ser 

145 150 

Leu Glu Arg Asp Glu Val Phe Val Pro Lys 
\ 155 160 

Gin Lys Ser Ala lie Lys Trp Tyr Gly Lys 
\l65 170 

Tyr Leu Asn Asn Pro Leu Gly Arg lie. Met 

\75 180 

Met Leu Thr Val GM Phe Val Leu Gly Trp 

185 190 

Pro Leu Tyr Leu Ala\Phe Asn Val Ser Gly 

195\ 200 

Arg Pro Tyr Asp Gly Rhe Ala Cys His Phe 

2.05 \ 210 

Phe Pro Asn Ala Pro iTe Tyr Asn Asp Arg 

215 \ 220 

Glu Arg Leu Gin lie Tyr\ Leu Ser Asp Ala 

225 \ 230 

Gly lie Leu Ala Val Cys Phe Gly Leu Tyr 

235 \ 240 

Arg Tyr Ala Ala Ala Gin Gly Met Ala Ser 

245 \ 250 

Met lie Cys Leu Tyr Gly Val Pro Leu Leu 

255 \ 260 

He Val Asn Ala Phe Leu Val teu He Thr 

265 \ 270 



Tyr Leu Gin His Thr His Pro Sfer Leu Pro 

275 \ 280 
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His TVr 




Ser 
285 

Ala 
295 

Asn 
305 

His 
315 

Pro 
325 

Ala 
335 

Gin 
345 



Glu 
Thr 
Lys 
Val 
His 
He 
Phe 




Trp Asp 
Val Asp 
Val Phe 
Ala His 
Tyr Asn 
Lys Pro 
Asp Gly 
Arg Glu 
Pro Asp 
Tyr Trp 



Trp Leu 
290 

Arg Asp 
300 

His Asn 
310 

His Leu 
320 

Ala. Met 
" 330 

He Leu 
34 0 

Thr Pro 
350 

Ala Lys 

360 

Arg Glu 
370 

Tyr Asn 
380 



(2) INFORMATION \FOR SEQ ID NO : 7 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH : \ 3 8 4 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEPNESS : 

(D) TOPOLOGY^ linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Gly Ala Gly Gly\Arg Met Gin Val Ser 

5 V 10 

Pro Pro Ser Lys Lys Ser Glu Thr Asp Asn 

15 \ 20 

He Lys Arg Val Pro CMs Glu Thr Pro Pro 

25 \ 30 
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Phe Thr Val 




Leu Thr Val 



Leu Tyr Leu 



Lys Ala lie 
40 

Ser lie Pro 
50 

Trp Asp lie 
60 

Tyr Val Ala 
70 

Pro Asn Pro 

: 80 

Leu Tyr Trp 
90 

Thr Gly Val 
100 

Gly His Ala 
110 

Leu Asp Asp 
120 

Ser Phe Leu 
130 

Lys Tyr Ser 
140 

Thr Gly Ser 
150 

Val Pro Arg 
160 

Thr Ala Ser 
170 

Thr Val Met 
180 

Gly Trp Pro 
190 

Ser Gly Arg 
200 
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Pro Tyr\ Asp Gly Gly Phe Ala Cys His Phe 

205 210 

His Pro ^sn Ala Pro He Tyr Asn Asp Arg 

215 220 

Glu Arg lieu Gin He Tyr He Ser Asp Ala 

225 230 

Gly He Le\i Ala Val Cys Tyr Gly Leu Leu 

235 240 

Pro Tyr Ala\Ala Val Gin Gly Val Ala. Ser 

245 250 

Met Val Cys E^ie Leu Arg Val Pro Leu Leu 

255 260 

He Val Asn GlV Phe Leu Val Leu He Thr 

265 270 

Tyr Leu Gin His\Thr His Pro Ser Leu Pro 

\21S 280 

His Tyr Asp Ser Ser Glu Trp Asp Trp Leu 

mS 290 

Arg Gly Ala Leu Ala Thr Val Asp Arg Asp 

295 300 

Tyr Gly lie Leu Asn\Gln Gly Phe His Asn 

305\ 310 

He Thr Asp Thr His &lu Ala His His Leu 

315 \ 320 

Phe Ser Thr Met Pro His Tyr His Ala Met 

325 \ 330 

Glu Ala Thr Lys Ala lie Lys Pro He Leu 

335 \ 340 

Gly Glu Tyr Tyr Gin Phe \Asp Gly Thr Pro 

345 \ 350 

Val Val Lys Ala Met Trp Arg Glu Ala Lys 

355 \ 360 



Glu Cys He Tyr Val Glu Ptfo Asp Arg Gin 

365 \ 370 
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Gly Glu uys Lys Gly Val Phe Trp Tyr Asn 

375 380 

Asn Lys Leu Xaa 



(2) INFORMATION FOR SEQ ID NO : 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 09 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(xi) SEQUENCE ^DESCRIPTION: SEQ ID NO : 8 

Ser Leu Leu Th\ Ser Phe Ser Tyr Val Val 

5 10 

Tyr Asp Leu Ser \Phe Ala Phe lie Phe Tyr 

5 20 

lie Ala Thr Thr TVr Phe His Leu Leu Pro 

25 30 

Gin Pro Phe Ser Leu lie Ala Trp Pro He 

35\ 40 

Tyr Trp Val Leu GlmGly Cys Leu Leu Thr 

45 \ 50 

Arg Val Cys Gly His kis Ala Phe Ser Lys 

55 \ €0 

Tyr Gin Trp Val Asp A&p Val Val Gly Leu 

65 \ 70 

Thr Leu His Ser Thr Leu Leu Val Pro Tyr 

75 \ 80 

Phe Ser Trp Lys He Ser \ His Arg Arg His 

85 \ 90 

His Ser Asn Thr Gly Ser fieu Asp Arg Asp 

95 \ 100 

Glu Arg Val Lys Val Ala tAp Phe Ser Lys 

105 \ 110 



Tvr Leu Asn Asn Pro Leu Gly, Arg Ala Val 

115 \ 120 
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Ser Liju Leu Val Thr Leu Thr lie Gly Trp 

125 130 

Pro Me^: Tyr Leu Ala Phe Asn Val Ser Gly 

135 140 

Arg Pro\Tyr Asp Ser Phe Ala Ser His Tyr 

145 150 

His Pro ^yr Arg Val Arg Leu Leu lie Tyr 

155 160 

Val Ser A^p Val Ala Leu Phe Ser Val Thr 

165 : 170 

Tyr Ser Le\\ Tyr Arg Val Ala Thr Leu Lys 

175 180 

Gly Leu Val \Trp Leu Leu Cys Val Tyr Gly 

185 190 

Val Pro Leu L^u lie Val Asn Gly Phe Leu 

195 200 

Val Thr lie Th^ Tyr Leu Arg Val His Tyr 

205 210 

Asp Ser Ser Glu Vrp Asp Trp Leu Lys Gly 

15 220 

Ala Leu Ala Thr Met Asp Arg Asp Tyr Gly 

225 230 

He Leu Asn Lys Val Phe His His He Thr 

23a 240 

Asp Thr His Val Ala\His His Leu Phe Ser 

245 \ 250 

Thr Met Pro His Tyr kis Leu Arg Val Lys 

255 \ 260 

Pro He Leu Gly Glu TVr Tyr Gin Phe Asp 

265 \ 270 

Asp Thr Pro Phe Tyr Lyfe Ala Leu Trp Arg 

275 \ 280 



Glu Ala Arg Glu Cys Leu\Tyr Val Glu Pro 

285 \ 290 



97/30582 



PCT/US97/02187 



91 



Asp \lu Gly Thr Ser Glu Lys Gly Val Tyr 

Trp tAt Arg Asn Lys Tyr Leu Arg Val 
x 305 

(2) INFORMATION FOR SEQ ID NO: 9 

( i ) SEQUENCE CHARACTERISTICS : 

(A) \ LENGTH: 302 amino acids 

(B) \ TYPE: amino acid 
(C> \ STRANDEDNESS : 
(D) YrOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Phe Ser Tyr\val Val Tyr Asp Leu Thr lie 

Ala Phe Cys \eu Tyr Tyr Val Ala Thr His 

Tyr Phe His tiu Leu Pro Gly Pro Leu Ser 

Phe Arg Gly Met\ Ala He Tyr Trp Ala val 

Gin Gly Cys He \eu Thr Gly Val Trp Val 

Val Ala Phe Ser aL Tyr Gin Leu Leu Asp 

55 

Asp lie Val Gly Leu\ He Leu His Ser Ala 

6 5 

Leu Leu Val Pro Tyr \he Ser Trp Lys Tyr 

Ser His Arg Arg His hL Ser Asn Thr Gly 

85 \ 

Ser Leu Glu Arg Asp Glu\val Phe Val Pro 
Lys val Ser Lys Tyr Leu \sn Asn Pro Pro 
Oly Arg Val Leu Thr Leu A^a Val Thr Leu 
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Thr ^eu Gly Trp Pro Leu Tyr Leu Ala Leu 
125 130 

Asn V^l Ser Gly Arg Pro Tyr Asp Arg Phe 
135 140 

Ala Cys\ His Tyr Asp Pro Tyr Gly Pro lie 

145 150 

Tyr Ser Yal lie Ser Asp Ala Gly Val Leu 
155 160 

Ala Val V^l Tyr Gly Leu Phe Arg Leu Ala 
165 170 

Met Ala Lys\ Gly Leu Ala Trp Val Val Cys 

175 180 

Val Tyr Gly Val Pro Leu Leu Val Val Asn 

185 190 

Gly Phe Leu Val Leu lie Thr Phe Leu Gin 

195 200 

His Thr His Val\ Ser Glu Trp Asp Trp Leu 
205 210 

Arg Gly Ala Leu ^.la Thr Val Asp Arg Asp 

as 220 

Tyr Gly He Leu Asai Lys Val Phe His Asn 
225 230 

He Thr Asp Thr HisWal Ala His His Leu 
235 \ 240 

Phe Ser Thr Met Pro His Tyr His Ala Met 
245 \ 250 

Glu Ala Thr Val Glu Tyr Tyr Arg Phe Asp 
255 \ 260 

Glu Thr Pro Phe Val Lys\Ala Met Trp Arg 
265 \ 270 

Glu Ala Arg Glu Cys He wr Val Glu Pro 
275 \ 280 



Asp Gin Ser Thr Glu Ser Lyte Gly Val Phe 

285 \ 290 
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Trp Tyr\Asn Asn Lys Leu Ala Met Glu Ala 
295 300 

Thr Val 



(2) INFORMATION FOR SEQ ID NO: 10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 3 72 amino acids 

(B) TYPE: amino acid 

(C) SXRANDEDNESS : 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Met Gly Ala GUy Gly Arg Met Thr Glu Lys 
5 10 

Glu Arg Glu LyS^ Gin Glu Gin Leu Ala Arg 

15 20 

Ala Thr Gly Gly\Ala Ala Met Gin Arg Ser 

t25 30 

Pro Val Glu Lys fcro Pro Phe Thr Leu Gly 

\s 40 



Gin lie Lys Lys Ala He Pro Pro His Cys 

50 



4 \ 

Val 



Phe Glu Arg Ser Val Leu Lys Ser Phe Ser 
55 \ 60 



Tyr Val Val His Asp\Leu Val He Ala Ala 
65 \ 70 

Ala Leu Leu Tyr Phe kla Leu Ala He He 

75 \ 80 

Pro Ala Leu Pro Ser pWo Leu Arg Tyr Ala 
85 \ 90 

Ala Trp Pro Leu Tyr Trk He Ala Gin Gly 

95 \ 100 

Ala Phe Ser Asp Tyr Ser\Leu Leu Asp Asp 

105 \ 110 



Val Val Gly Leu Val Leu His Ser Ser Leu 

115 \ 120 
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Met yal Pro Tyr Phe Ser Trp Lys Tyr Ser 
125 130 

His A^-g Arg His His Ser Asn Thr Gly Ser 
135 140 

Leu Gl\\ Arg Asp Glu Val Phe Val Pro Lys 
145 150 

Lys Lys ^lu Ala Leu Pro Trp Tyr Thr Pro 
155 160 

Tyr Val T^r Asn Asn Pro Val Gly Arg Val 
165 : 170 

Val His Ile\ Val Val Gin Leu Thr Leu Gly 
175 180 

Trp Pro Leu ^yr Leu Ala Thr Asn Ala Ser 
185 190 

Gly Arg Pro T^ ?ro Ar 9 Pne A la Cys His 
195 200 

Phe Asp Pro Tyr\Gly Pro lie Tyr Asn Asp 

05 210 

Arg Glu Arg Ala Gin lie Phe Val Ser Asp 
2i5 220 

Ala Gly Val Val Alk Val Ala Phe Gly Leu 
225i 230 

Tyr Lys Leu Ala Ala\Ala Phe Gly Val Trp 
235 \ 240 

Trp Val Val Arg Val 'tyr Ala Val Pro Leu 
245 \ 250 

Leu lie Val Asn Ala TAp Leu Val Leu lie 
255 \ 260 

Thr Tyr Leu Gin His Thr\His Pro Ser Leu 
265 \ 270 

Pro His Tyr Asp Ser Ser <3lLu Trp Asp Trp 
275 \ 280 



Leu Arg Gly Ala Leu Ala Ttir Met Asp Arg 
285 1 290 
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Asp Tyr Gly lie Leu Asn Arg Val Phe His 

295 300 

Asn Ilif Thr Asp Thr His Val Ala His His 

305 310 

Leu Phe ^er Thr Met Pro His Tyr His Ala 

315 320 

Met Glu A^La Thr Lys Ala lie Arg Pro lie 

325 330 

Leu Gly AsJ? Tyr Tyr His Phe Asp Pro : Thr 

335 340 

Pro Val Ala\Lys Ala Thr Trp Arg Glu Ala 

345 350 

Gly Glu Cys 3^e Tyr Val Glu Pro Glu Asp 

355 360 

Arg Lys Gly Va\ Phe Trp Tyr Asn Lys Lys 

365 370 

Phe Xaa 



(2} I NFORMAT I ON \ FOR SEQ ID NO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 amino acids 

(B) TYPE: amiVio acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: \linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Trp Val Met Ala His Asp Cys Gly His His 
5 \ 10 

Ala Phe Ser Asp Tyr Gin Leu Leu Asp Asp 
15 \ 20 

Val Val Gly Leu lie Leu\His Ser Cys Leu 
25 \ 30 

Leu Val Pro Tyr Phe Ser i\rp Lys His Ser 
35 \ 40 



His Arg Arg His His Ser Asn Thr Glv Ser 
45 \ 50 
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Leu Glu 
Lys Lys 
Tyr Leu 
Thr He 
Pro Leu 
Arg Pro 
Asp Pro 
Glu Arg 
Gly Val 
Gin Leu 
Val Val 
Val Val 
Phe Leu 
His Tyr 
Arg Gly 
Tyr Gly 
He Thr 



A^g Asp Glu 
55 

Set Ser He 
65 

Asn \Asn Pro 
75 



Ala 



.1 Thr 
85 



Tyr Leu Ala 

V s 

Tyr Asp \ Arg 
\105 



y Pf c 



Tyr Gly Kro 
li5 



He Glu II 
12 

Leu Ala Val 
135 

Ala He Ala 
145 



Cys Val Tyr 
155 

Asn Ser Phe 

165 

Gin His Thr 

175 

Asp Ser Ser 
185 

Ala Leu Ala 
195 

He Leu Asn 
205 

Asp Thr Gin 

215 



Val Phe Val Pro Lys 
60 

Arg Trp Tyr Ser Lys 
70 

Pro Gly Arg He Met 
80 

Leu Ser Leu Gly Trp 
90 

Phe Asn Val Ser :Gly 
100 

Phe Ala Cys His Tyr 
110 

He Tyr Asn Asp Arg 
120 

Phe He Ser Asp Ala 
130 

Thr Phe Gly Leu Tyr 
140 



s Gly Leu Ala Trp 
150 



Gly Val Pro Leu Leu 
160 

Leu\Val Leu He Thr 
170 

His \pro Ala Leu Pro 
180 

Glu iVp Asp Trp Leu 
190 

Thr Val Asp Arg Asp 
200 

Lys Val ^>he His Asn 
210 

Val Ala rtis His Leu 
220 
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Phe\ Thr Met Pro 

INFORMATION FOR SEQ ID NO: 12 

IEQUENCE CHARACTERISTICS: 

k ) LENGTH: 20 nucleotides 
(fe) TYPE: nucleotide 
(C) STRANDEDNESS : single 
(DO TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GCTCTTTTGT GCGCTCATTC 2 0 

(2) INFORMATION FOR SEQ ID NO: 13 

"(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNESS: single 
<D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CGGTACCAGA AAAGGCCTTG 20 

(2) INFORMATION FOR SEQ ID NO: 14 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 1 20 nucleotides 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TAYWSNCAYM GNMGNCAYCA 20 

(2) INFORMATION FOR SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 Nucleotides 

(B) TYPE: nucleotide 

(C) STRANDEDNES^: single 

(D) TOPOLOGY: lihear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
RTGRTGNGCN ACRTGNGTRT G 21 



